Improving Name Tagging by Reference Resolution and Relation Detection
نویسندگان
چکیده
Information extraction systems incorporate multiple stages of linguistic analysis. Although errors are typically compounded from stage to stage, it is possible to reduce the errors in one stage by harnessing the results of the other stages. We demonstrate this by using the results of coreference analysis and relation extraction to reduce the errors produced by a Chinese name tagger. We use an N-best approach to generate multiple hypotheses and have them re-ranked by subsequent stages of processing. We obtained thereby a reduction of 24% in spurious and incorrect name tags, and a reduction of 14% in missed tags.
منابع مشابه
Corefrence resolution with deep learning in the Persian Labnguage
Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...
متن کاملRule-based reference resolution for unrestricted text using part-of- speech tagging and noun phrase parsing
This paper describes an experimental syntactic rule-based method for reference resolution in unrestricted texts. References can be resolved automatically and this overcomes a major hurdle in text analysis and provides a key advantage in text `understanding' and information extraction. A shortcoming of systems that locate and extract sentences from unrestricted text to help people assimilate inf...
متن کاملA New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model
Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...
متن کاملDefinite Description Resolution in Spanish
In this work, a method for the resolution of the identity co-reference produced by definite descriptions in Spanish texts is presented. This method is based on the linguistic knowledge acquired by POS-tagger, synonymous dictionary and relationships between names and verbs resources. This method uses a system of restrictions and preferences in order to obtain the correct antecedent. The method a...
متن کاملImprovement of Breast Cancer Detection Using Non-subsampled Contourlet Transform and Super-Resolution Technique in Mammographic Images
Introduction Breast cancer is one of the most life-threatening conditions among women. Early detection of this disease is the only way to reduce the associated mortality rate. Mammography is a standard method for the early detection of breast cancer. Today, considering the importance of breast cancer detection, computer-aided detection techniques have been employed to increase the quality of ma...
متن کامل